Discovering CNVs from read depth analysis of next generation sequencing data
نویسندگان
چکیده
Genomic Structural Variations (SV), including Copy Number Variations (CNV), are believed to contribute significantly to variations between human individuals, and may have larger effect on phenotype than SNPs [1, 2]. Therefore, the importance of their discovery has been realized. While originally detected from analysis of aCGH array data, SVs/CNVs can now be more efficiently discovered from next generation sequencing data such as Solexa/Illumina, SOLiD, Helicos etc. Here we present a novel method to detect deletions and duplications from statistical analysis of mapping density (read depth) of short reads from next generation sequencing platforms. We have adopted a mean-shift technique originally developed in computer science for image processing [3, 4] to analysis of read depth (RD) data. This approach performs the discontinuity preserving smoothing of RD signal through kernel density estimation and the mean-shift computation and results in segmentation of RD signal across genome. Then, CNVs regions are predicted from the analysis of calculated segments.
منابع مشابه
An enhanced method for targeted next generation sequencing
Copy number variants (CNV) are a major cause of disease, with over 30,000 reported in the DECIPHER database. To use read depth data from targeted Next Generation Sequencing (NGS) panels to identify CNVs with the highest degree of sensitivity, it is necessary to account for biases inherent in the data. GC content and ambiguous mapping due to repetitive sequence elements and pseudogenes are the p...
متن کاملGROM-RD: resolving genomic biases to improve read depth detection of copy number variants
Amplifications or deletions of genome segments, known as copy number variants (CNVs), have been associated with many diseases. Read depth analysis of next-generation sequencing (NGS) is an essential method of detecting CNVs. However, genome read coverage is frequently distorted by various biases of NGS platforms, which reduce predictive capabilities of existing approaches. Additionally, the use...
متن کاملcn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate
Quantitative analyses of next-generation sequencing (NGS) data, such as the detection of copy number variations (CNVs), remain challenging. Current methods detect CNVs as changes in the depth of coverage along chromosomes. Technological or genomic variations in the depth of coverage thus lead to a high false discovery rate (FDR), even upon correction for GC content. In the context of associatio...
متن کاملCNV-CH: A Convex Hull Based Segmentation Approach to Detect Copy Number Variations (CNV) Using Next-Generation Sequencing Data
Copy number variation (CNV) is a form of structural alteration in the mammalian DNA sequence, which are associated with many complex neurological diseases as well as cancer. The development of next generation sequencing (NGS) technology provides us a new dimension towards detection of genomic locations with copy number variations. Here we develop an algorithm for detecting CNVs, which is based ...
متن کاملSensitive and accurate detection of copy number variants using read depth of coverage.
Methods for the direct detection of copy number variation (CNV) genome-wide have become effective instruments for identifying genetic risk factors for disease. The application of next-generation sequencing platforms to genetic studies promises to improve sensitivity to detect CNVs as well as inversions, indels, and SNPs. New computational approaches are needed to systematically detect these var...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009